SPARSE REGULAR RANDOM GRAPHS: SPECTRAL DENSITY AND 

EIGENVECTORS 
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Abstract. We examine the empirical distribution of the eigenvalues and the eigenvectors of adjacency 
matrices of sparse regular random graphs. We find that when the degree sequence of the graph slowly 
increases to infinity with the number of vertices, the empirical spectral distribution converges to the semicircle 
law. Moreover, we prove concentration estimates on the number of eigenvalues over progressively smaller 
intervals. We also show that, with high probability, all the eigenvectors are delocalized. 



1. Introduction 

Consider the uniform distribution over the space of all labeled simple graphs on n vertices where every 
vertex has degree d. We denote a graph randomly selected from this distribution by G(n, d); the vertices of 
G(n, d) will always be labeled by {1, 2, . . . , n}. 

Now, consider a sequence of such random graphs {G(n, d„), n € N} which are (i„-regular of order n. We 
assume d n to be slowly growing with n in a manner which will be made more precise later. Consider the 
adjacency matrix A n of G(n,d n ); the (i,j)th element of A n is one or zero depending on whether there is 
an edge between vertices i and j in the graph G(n,d n )- The random matrix A n is always symmetric, and 
it has n real eigenvalues (perhaps not all distinct) and corresponding real eigenspaces. Under appropriate 
conditions on the growth of the sequence d n , we study the following phenomena as n tends to infinity: 

(i) Global semicircle law. We prove the convergence of the empirical spectral distribution (ESD) of the 
scaled adjacency matrix to the probability measure on [—2, 2] with density 

(1) fsc(x) = ^-V±~x 2 , ~2<x<2. 

(ii) Local semicircle law (a.k.a. the semicircle law on short scales). We obtain concentration estimates of 
the deviation of the number of eigenvalues Mi that lie in a small interval / from its predicted number 
n J T f sc (x)dx. The size of / will be taken to be vanishing at an appropriate rate with increasing n. 

(iii) Delocalization of eigenvector coordinates. We obtain probability estimates of the event that, for 
some eigenvector, a few of the coordinates are significantly larger in magnitude than the rest. 

These problems connect two areas of study: Wigner random matrices and spectra of sparse random 
graphs; since we have already mentioned the latter, we will now talk about the former. 

In the last couple of decades there has been an enormous amount of activity in the study of universal 
properties of random matrices, inspired by their connection to (universal) physical systems. The literature 
on universality studies in random matrices is vast; we mention here only a few references and ask the reader 
to look to them for further ones. 

For an introduction and motivation to the subject, we recommend Deift's ICM address [TS]. Of particular 
interest are the Wigner matrices (see Bai [3] , Soshnikov [SS] , Bai and Yao , Khorunzhy, Khoruzhenko, and 
Pastur [37] , Guionnet and Zeitouni [35 , Ben Arous and Peche [10] , Tao and Vu [60] [61] . Deift and Goiev 
[19] looked at different potential functions on symmetric, hermitian, and self-dual matrices; Baik and Suidan 
explored connections to percolation [7] and random walks [5] ; /3-generalizations of the classical ensembles and 
universal properties thereof have been explored in Forrester and Baker (30] . Johansson [36] . Dumitriu and 
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Edelman [21,. Recently, more sophisticated probability tools have been generalized and applied to random 
matrix theory (e.g. the Lindeberg Principle, by Chatterjee [TS] and Tao and Vu [t)Ul 161) ). 

Most of the focus in universality research has been on proving, under progressively weaker assumptions 
on the entry distribution, the following: 

- convergence of the ESD to the semicircle or Marcenko-Pastur laws and establishing rates of con- 
vergence in various ways (large deviations, concentration estimates, central limit theorems). For a 
comprehensive treatment of the subject, see the books by Bai and Silverstein [SJ and by Anderson, 
Guionnet, and Zeitouni [2J; 

- fluctuations of the spectrum at the edge (the famous Tracy- Widom laws [62J [63j [64]) for general 
Wigner matrices, settled by Tao and Vu [61] and Erdos, Ramirez, Schlein, Tao, Vu, and Yau [SJJ; 

- universality of correlation functions in the bulk, under various assumptions (Tao and Vu |60j . Erdos, 
Ramirez, Schlein, Tao, Vu, and Yau [57] , Erdos, Peche, Ramirez, Schlein, and Yau [55], Erdos, Yau, 
and Yin [SJ; 

- partial or complete derealization of the eigenvectors (by Erdos, Schlein, and Yau [24], [25], and Tao 
and Vu [60], [ST]). 

The aim of said research has been to show that the spectral statistics agree in the large n limit to the 
spectral statistics of the Gaussian Orthogonal Ensemble (GOE) and the Gaussian Unitary Ensemble (GUE), 
depending on whether the matrices are real symmetric/positive-definite, or complex hermitian/positive def- 
inite. The Gaussian and Wishart ensembles are some of the most studied and best understood random 
matrix models; for an easy introduction to classical random matrix theory, see the books by Mehta |49j and 
Muirhead [53]. 

Parallel to these developments, in combinatorics and discrete mathematics, there has always been an 
interest in studying spectral properties of deterministic and random graphs. There are two matrices of 
interest in spectral graph theory: the adjacency (sometimes called the incidence) matrix, which we already 
defined, and the Laplacian matrix. These matrices are the same for regular graphs (although, in general, 
they can be quite different), and the spectrum of the graph is the spectrum of the matrix. 

Among the properties of random graphs that have been the focus of intense research are connectivity, 
phase transitions, and the limiting spectral distribution of random graphs, including trees (McKay 05], Feige 
and Ofek [29 , Mirlin and Fyodorov 52J, Bauer and Golinelli [9], Semerjian and Gugliandolo [55], Bordenave 
and Lelarge [T3], Bhamidi, Evans, and Sen |llj). Other properties include concentration of eigenvalues 
(Krivelevich and Sudakov ([H], Alon, Krivelevich, and Vu [1]), the spectral gap (Furedi and Komlos 34 , 
Friedman [31] and Friedman and Alon [33] , Broder and Shamir |15|). 

Another area of recent interest is the study of quasi-random graphs and expanders. These are non- 
random graphs which display properties one expects to hold with high-probability for certain classes of 
random graph models. For example, expanders are sparse graphs that have high connectivity properties 
(e.g., a large spectral gap). These graphs are often regular (for example, the famous Ramanujan graph, 
described in the seminal articles by Lubotzky, Phillips, and Sarnak [43] and Morgenstern [51] ). Random 
d-regular graphs display the same connectivity properties with very high probability, when d is kept fixed 
and the order is large; this is in essence the Alon Conjecture, recently settled by Friedman 33 . Thus a 
study of random regular graphs suggests possible properties of (deterministic) expanders. 

It is easy for a probability audience to appreciate the importance of studying eigenvalues of the graph 
(e.g., the spectral gap which determines the mixing properties of a random walk), but eigenvectors of graphs 
are equally important, especially since they are the solutions of various combinatorial optimization problems. 
Traditionally, there has been much less work on computing the actual graph eigenvector distributions, with 
the notable and recent exception of Wishart-like sample covariance matrices (see Bai, Miao, and Pan [3]). 
Thus, developments in examining properties of the eigenvectors of large random graphs (as in Friedman 
[32] and Dekel, Lee, and Linial [20]) are relatively new, and motivated by the applications of eigenvectors 
to engineering and computer science. Such applications include the Google Page- Rank algorithm [14], the 
Shi-Malik algorithm [57], the Meila-Shi algorithm [50] and other spectral clustering techniques and related 
segmentation problems (Weiss [57J, Pothen, Simon, and Liou [S3], etc.). 

It is probably clear by now that the two fields of research that we have very briefly sketched here (uni- 
versality studies in random matrix theory and spectra of random graphs) are vast and, by examining the 
two lists of important problems we have outlined, one can see that there is a certain amount of overlap. 

2 



Naturally, this lead to a few papers where the two fields have intersected, despite differences in both the 
goals and the methodology of each. 

A famous such example is McKay's derivation of the limiting empirical spectrum of random d-regular 
graphs on n vertices, as d is fixed and n grows to infinity [45] . In that case, the empirical spectral distribution 
converges in probability to what is known as the McKay (or Kesten-McKay) law, which has a density 

(2) Hx) = ^^Jp , ~2Vd—l< X <2Vd—l. 

This density had appeared earlier in Kesten's work on random walks on groups |39j . It can be easily verified 
that as d grows to infinity, if we normalize the variable x in the above by y/d — 1, the resulting density 
converges to the semicircle law on [—2, 2]. 

This naturally raised the question of whether the study of "universal" properties could be pushed into the 
domain of regular random graphs with increasing degree. The rate of growth of the degree sequence plays 
an important role, since at both extremes (d fixed and d = n — 1) the ESDs do not converge to the semicircle 
law. 

The answer to this question turns out to be difficult. There are a number of major obstacles to developing 
an applicable universality theory in the spirit of Wigner random matrices to adjacency matrices of random 
graphs, which are non- Wigner: these matrices are sparse and the entries are not independently distributed. 

To see how sparsity affects concentration, consider the question of proper scaling of the adjacency matrix 
A n . In the Wigner case, the scaling factor is clearly 1/y/n, which puts all of the eigenvalues in [— 2 — e, 2 + e], 
for any positive e, with very high probability for a sufficiently large n. One might be tempted then to believe 
that the proper scaling for adjacency matrices is l/^/d^, as this achieves the same kind of finite row- variance 
as l/y/n does in the Wigner case. Unfortunately, it is not known if this scaling will place all the eigenvalues 
(except the first) in a compact interval, as d n — > oo. 

For the regime when d is fixed, the Alon conjecture states that the second largest eigenvalue (in absolute 
value) A2 has an upper bound | A2 1 < 2yd — 1 + e, with very high probability. The well-known lower 
bound holds for every d-regular graph and we cite it from Friedman [31]: | A2 1 > 2^/d— 1 + O (log d/log n). 
Unfortunately, when d grows with n, the upper bound is not known to hold outside of a narrow growth 
regim^] Khorunzhy |40j has shown that, for a random matrix model similar to the adjacency matrix of 
the Erdos-Renyi random graph on n vertices with an expected degree d n 3> log n, with probability one, 
the spectral norm of the adjacency matrix grows faster than \fd^. Although this does not necessarily affect 
convergence of the ESD to the semicircle law, it eliminates the possibility of containing all the eigenvalues 
of the rescaled centralized adjacency matrix within any compact interval. 

Our results investigate the extent to which universality can be extended to the slowly growing d n case. 
Our first result is Theorem [l] stated below. 

Theorem 1. Let d n satisfy the asymptotic condition 

(3) lim d n — 00, d n — 1 = n e ™ , for some e n = o(l). 

n— >oo 

Then the ESD of the matrix (d n — l) _1 ' 2 y4. n , where A n denotes the adjacency matrix of G n , converges in 
distribution to the semicircle law on [—2, 2] which has a density 

(4) f 8C (x) := -^4-^, -2<x<2. 

ZTT 

The condition on d n , for example, includes the logarithmic regime, d n = (logn) 7 for any positive 7; in 
which case we can define e n as 7 log logn/ logn. 

Our proof of this result (and the following ones) depends crucially on two facts: 

(i) the "locally tree-like" property, which states that with high probability, most vertices in a random 
regular graph will have a (increasingly larger) neighborhood which is free of any cycles, and 

(ii) the fact that d n grows to infinity, which smooths out irregularities as n tends to infinity. 
Our second result is arguably the most important one in this paper. 



More precisely, even under this narrow growth regime the theorem is valid only for Friedman's permutation model. For d 
fixed, Friedman's model approaches the uniform distribution with increasing n; this is no longer clear once d n grows with n. 
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Theorem 2. Fix 6 > 0. Let d n = (logn) 7 , where 7 > 0. Let rj n — (r n — r~ 1 )/2 where r n = exp(d~ a ) for 
some < a < min(l, I/7). Then there exists an N large enough such that for all n > iV, for any interval 
Jcl of length \L\ > ma,x{2r] n ,i] n /(— <51og<5)} ; 



with probability at least 1 — o(l/n). Here Mi is the number of eigenvalues of ± A n in the interval L, and 
f sc refers to the density of the semicircle law as in 0. 

Remark 1 . Note that the shortest length of the interval / that our methods can narrow down to is of length 
i] n , which is roughly about 1/logn, if d n >• logn, and l/d n , if d n <C logn. For Wigner matrices a far shorter 
scale can be achieved (effectively poly- log over n in [21] )■ Such sharp estimates are not to be expected in 
the graph case, and this again is a consequence of sparsity and lack of concentration estimates. 

Remark 2. A close examination of the proof of Theorem [2] reveals that it can be extended to any deterministic 
sequence of regular graphs of increasing size and degree, as long as the "locally tree-like" property holds at 
"most" vertices. 

Remark 3. Since the submission of this paper, significant progress has been made in proving the local 
semicircle law for random regular graphs in any kind of growth regime for d n (see Tran, Vu, and Wang [65 ) . 
Their methods rely on proving the local semicircle law first for Erdos-Renyi graphs with suitable parameters, 
and then using a result by McKay and Wormald [UJ about the probabililty that an Erdos-Renyi graph is 
regular. Their result subsumes ours (in the sense that the lower bound on the length of the interval / is 
smaller) for the case when d n = f2((logn) 10 ); when d n — o((log n) 10 ), our result is slightly stronger in the 
same sense. 

Although these results are similar up to a point to the Wigner matrix results, our methodology is essentially 
different. Due to sparsity and lack of concentration, we had to adapt a more combinatorial set of tools (in 
particular, the tree approximation) as well as tools from linear algebra to the Stieltjes transform approach 
used in [H] and [50]. 

Several recent articles have done extensive simulations on eigenvalues and eigenvectors of random graphs, 
with surprising conclusions. For example, Jakobson et al. [38J carries out a numerical study of fluctuations 
in the spectrum of regular graphs. Their experiments indicate that the level spacing distribution of a generic 
k-regular graph approaches that of the GOE as we increase the number of vertices. On the eigenvector front, 
in the article [22] by Elon, the author attempts to characterize the structure of the eigenvectors by suggest- 
ing (with numerical observations) that all, except the first, follow approximately a Gaussian distribution. 
Additionally, the local covariance structure has been conjectured to be given by explicit functions of the 
Chebyshev polynomials of the second kind. In particular, if two vertices on the graph are at a distance k 
from each other, it is conjectured that the covariance between the coordinates of any eigenvector at the two 
vertices decays exponentially in k. 

All this empirical data points to universality properties of the adjacency matrices of large, sparse regular 
graphs; we took here a first step toward proving them. 

If the eigenvectors are indeed uniformly distributed over the sphere then they must (with high probability) 
satisfy derealization. We give upper bounds on the probability of this phenomenon. 

We use the following definition of derealization, similar to the one used in [21] . 

Definition 1. Let T be a subset of {1,2,..., n} of size L > 1. Let 5 > be some fixed number. We say 
that a vector v = (v(l), . . . ,v(n)) G M. n with L 2 norm ||«||2 = 1 exhibits (T, 5) localization if 



The vector v is said to be (L, 5) localized there exists some set T C {1,2, ... ,n} such that |T| = L such that 
v is (T, 8) localized. 

Below is our result on eigenvector derealization. 




v 



ItII^EK-?)! 2 ^ 1 - 5 - 



Theorem 3. Assume the set-up of Theorem^ Fix 6 > 0. 
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(i) Let T n C {1, 2, . . . , n} be a deterministic sequence of sets of size L n = o(r/ n 1 ). Letfli(n) be the event 
that some normalized eigenvector of the matrix A n is (T n , 5) localized. Then, for all sufficiently large 
n, 

p«wn>^*(>-.(i))-i-.(i). 

(ii) Define the sequence 

1 loen 
C» = t , f f rv -4, n>2. 
4 log(d n - 1) 

Consider the (random) subset J(n) of all vertices in the graph whose Cn -neighborhood is free of cycles. 
Then, 

P (\AM >1 _vA = 1 _ o (i_ 

V n d nJ \n 

Moreover, there exists an n large enough such that the event that T n C J(n) and some normalized 
eigenvector is (T n ,5) has probability zero. 

Remark 4. More progress has been made on the eigenvector derealization front since the submission of this 
paper. In their paper [65], the authors prove that the £°° norms of all eigenvectors are o(l), regardless of 
the regime of growth of d n . Very recently, Erdos, Knowles, Yau, and Yin have posted a paper |23j proving 
the local semicircle law for Erdos-Renyi graphs with pn — fi(logn) up to a spectral window (an interval /) 
of size larger than 1/n; from this, they could deduce that the eigenvectors of such Erdos-Renyi graphs are 
completely delocalized, i.e. that the £°° norms of the normalized (unit) eigenvectors are at most of order 
1 / \N with high probability. It would be interesting to see if the methods of [55] (of deducing results for 
random regular graphs from the same results for Erdos-Renyi) can be combined with the theorems of [23 
to obtain complete eigenvector derealization (and, potentially, a much smaller spectral window) for random 
regular graphs with d n = fi(logn). 

The bounds in Theorem [3] is not sharp. There are severe technical obstacles in producing sharp bounds 
by adapting the strategy of Wigner matrices. One such example is eigenvalue collision, i.e., the event that 
A n does not have n distinct eigenvalues. Since A n has discrete entries, this event has a positive probability. 
However, to the best of our knowledge, no good bound on this probability is known. 

The paper is organized as follows. In Section [2] we prove the global convergence to the semicircle law 
(Theorem [lj . This is followed by the proof of the local semicircle law (Theorem [2]) in Section [3] The 
eigenvector derealization is proved in Section [4j Finally, Appendix A contains an exact calculation of 
eigenvalues and eigenvectors for the random regular (finite) tree, defined in Section [2] 

2. Global convergence to the semi-circle law 

Recall that G n — G(n,d n ) denotes a random d„-regular graph on n vertices whose adjacency matrix is 
A n . Recall that d n satisfies the asymptotic condition 

(5) lim d n — oo, d n — 1 = n e ", for some e n = o(l). 

n— >oo 

We prove here Theorem]!] namely, that the empirical spectral distribution (ESD) of the adjacency matrix 
A n converges in probability to the semicircle law on [—2,2] which we recall from Q. 

Our main instrument is to use the moment method. Our arguments depend crucially on the following 
local approximation of G n by a rooted tree. Consider the deterministic rooted tree S n , which is the infinite 
regular tree of degree d n with a distinguished vertex marked as the root. For a graph G whose every edge 
is taken to have unit length, consider the induced metric structure on G. We define the r-neighborhood of 
the vertex i, to be the subgraph of G whose vertices are at a distance at most r from i and whose edges 
are all the edges between those vertices. The following lemma makes precise the idea that, except for a 
vanishing proportion of the vertices, the r-neighborhood of any vertex is isomorphic to the corresponding 
neighborhood of the root in the tree S n . 

Recall that a cycle is a sequence of vertices {i\, . . . ,ik} of a graph such that i\ = i}., there is no other 
repeated vertex, and there is an edge between every successive ij and ij+i- The length of the cycle is the 
number of vertices except the initial one. A cycle of length k will be called a fc-cycle. Finally, a cycle-free or 
acyclic graph is a tree. 
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Lemma 4. Fix a positive integer r. Let r(n) be the subset of vertices of G n which have no cycles in their 
r -neighborhoods, and let \t(ii)\ denote the size ofr(n). Then, under the assumptions of we have 



Proof. We use the estimates of McKay, Wormald, and Wysocka [48] on the Poisson approximation to the 
number of short cycles in regular graphs. Let g(n) be a sequence such that 

(6) g(n) > 3, and (d n - l) 2 ^")- 1 = (n). 

For any s < g, let M s denote the number of cycles of length s in the graph G n . It has been shown in [35] 
that M s is approximately distributed as a Poisson random variable and 

E{M s )=fx s (l + 0(s(s + d)/n)) ) where Ms = (d ~ X) , 

(7) 2s 

and Var (M s ) = fj, s + O (s(s + d)/n))/i 2 . 

Consider now the growth of the degree sequence as in (§. If we choose g(n) such that 2g(n) — 1 = 
it will satisfy 

{d n -lf a ^- x =n^ = o{n). 
Additionally g(n) grows to infinity with n, since e(n) = o(l). 

Consider an s-cycle for some s < g(n). It has exactly s-vertices. Now, the number of vertices whose r 
neighborhoods fail to be acyclic because of this s-cycle are precisely those vertices which are at a distance 
of at most (2r — s)/2 from any of the vertices in the s-cycle. The number of such vertices has an easy 
upper bound of 2(d n - l)( 2, '- s )/ 2 s, for all large enough d n . Thus, the total number of vertices whose r 
neighborhoods are not acyclic can be bounded above by 

2r 

(8) n; = Y,Md n -i) {2r - s)/2 M s . 

s=3 

Also, 

(9) n — |r(n)| < N*. 

Taking expectations on both sides of and using formulas Q, we get 

= 2s ( d - l)^-*)/ 2 ^ 1 ^ (l + O (s(s + d)/n)) 

s=3 

The quantity 0(s(s + d) jn) denotes a function h(s, d, n) such that 

h(s,d,n) 

s(s + d) 

remains bounded for all choices of s, d, and n. Thus we get 

2r / 1 2r \ 

en: = (d - i) r J2( d ~ x ) s/2 + H s ( s + - 1 ) r+V2 = ° - 1 ) 2r ) ■ 

s=3 \ U s=3 / 

The last equality is true since, by our assumption on d n , the second term in the sum is o(l). 
Similarly, we can compute the second moment. By the Cauchy-Schwarz inequality 

2r 

Var(JV r *) < 2r^4s 2 (d- l) 2r - s Var(M ;j ) 

s=3 
2r 

< 2r J2 4s2 (d ~ !) 2r ^ [M S + O (s(s + d)/n)) M 2 ] 

s=3 

2r 2r 

< 2r ^ 4s 2 (d - l) 2r - s /i, + 2r ^ 4s 2 (d - l) 2r - s (s(s + d)/n) /i 2 . 

s—3 s—3 
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Plugging in the value of ii s from ^ we get 

It 

Var(iV;) < 4r 2 (2r + l)(d- l) 2r + 2r(d - l) 2r ^(d - l) s O (s(s + d)/n) . 

s=3 

As before, it thus follows that 

2r / 2r \ 



J^(d - l) s O (s(s + d)/n) = O J2( d - 1 ) Ss ( s + d )l n ) 

s=3 \s=3 / 

O (n- 1 (2r + d)(d - l) 2r J2] = (™ _1 (2r + d)(d - l) 2r r(2r + 1)) . 



\ 8=3/ 

Hence 

Var(iV;) < 4r 2 (2r + l)(d- l) 2r + <3(r 2 (2r + d){d- l) 4r /n) . 

Note again that, by our assumption, the quantity (d — l) 4r /n is o(l). 

We now want to use Markov's inequality to bound the tail probability of the quantity 1 — |r(n)| jn. Fix 
any e > 0. Then, by inequality ([9]), we get 

p(l- > e) < P {N; > ne) < -\^E (N* r f 

\ n J n s e z 

= ^\v w {n;) + {e{n;)) 2 

< i [4r 2 (2r + l)(d - l) 2r + r 2 (2r + d)o(l) + O ((d - l) 4r )] 



I .1 ' ' 

7? n 



(d-_l) 

2 



< e- 2 I V ^ : ; ] = e~ 2 fn 4 """ 2 ) 



by our choice of the sequence d n . 
Choosing e = n -1 / 4 we get 

p (l- > n- 1 '^ < V^O (n 4 — 2 ) = o (n- 5 / 4 ) , 

since e n — o(l). This completes the proof of the Lemma. □ 

Lemma 5. Lei {/^, i = 1, 2, . . .} &e a sequence of random probability measures on the real line, defined on 
the same probability space. Let /i be a non-random continuous probability measure supported on a compact 
interval I. Suppose there exits a pair of doubly indexed real-valued sequences {a ra (r), b n (r), r, n € N} such 
that the following hold. 

(1) For every r = 1, 2, . . we have 



P f U n { a »( r ) - / ^n(z) < b n (r)\ j = 1. 

\N=1 n>N L •/ J y 



(2) Lor every r = 1, 2, . . we ftove 

lim a n (r) = lim 6„(r) = / x r d/j J (x) < oo. 
T/ien </ie sequence of measures {/i„} converges to fi in probability. 



Proof. Let fl r be the event 



U Pi \a n (r) < [ x r dfi n (x)<b n (r) 

N=ln>N J 



Then, from condition (1), it follows that 

(oo \ / oo \ oo 

f]fi r Up MJ fi H <E p (^) = °- 
r=l / Vr=l / r=l 

Thus P(n^ 1 fi r ) = 1. 

Consider any fixed realization of the sequence {/x n } <E n£2.)fl r . By Helly's Selection Theorem, this 
sequence has a limit point v. Thus, there is a subsequence {/i nfc } that converges to v in the topology of weak 
convergence. 

Now take r to be a positive integer. We would like to show that 



lim 



x dfi nk 



c T dv. 



From the standard theory of weak convergence, it follows that this will be true if the function x r is uniformly 
integrable under the sequence of measures {p n J. However, uniform intcgrability follows from the following 
L 2 -boundedness condition: 



max/^„ fc <max 6 „ fc( 2,)<oo, 



by conditions (1) and (2). 

In particular, from condition (2) we reach the conclusion 



x r dv(x) 



x r d/i(x), r = 0,1,2,. 



Since the support of \i is the compact interval /, it follows that the Moment Problem has a unique solution, 
and hence, v must be equal to \i. 

This shows that any limit point of any sequence {/x ra } in V(£L-fil r is given by \x. By the usual subsequence 
argument, this shows that \i n converges to fi in the set Cl^^r, and hence with probability one. This proves 
the result. □ 

Proof of Theorem^ Consider the random graph sequence G n = G(n,d n ) as in the statement, and let A n 
be the adjacency matrix of G n . 

Let fi n be the ESD of the matrix (d n — l)~ 1 / 2 A n . Then, for any positive integer r, 



J x r dfi n (x) = itr ((<*„ - l)- r/2 A r n ) = {dn n 1} ^ XXfc 



0. 



Here A n (i,i) is the ith diagonal element of the matrix A r n . 

Note that A n (i,i) counts the number of paths of length r that start and end at i. Consider the set of 
vertices in r(n), as in Lemma|4j whose \r/2\ -neighborhood is acyclic. For any i G r(n), the number of such 
paths, B n , is equal to the number of paths of size r that start and end at the root of the tree S n . If i ^ t(ti), 
we use the trivial bound A n (i, i) < d n . Thus 

i=l 

n- |r(n)L 



(d n -l)-^^Br n < 



b: 



<{d n -l)-r/* B n 

If we define 

(10) a n (r) = (l-n- 1 /^(d n -l)- r / 2 B r n , b n (r) = (d n - l)" r / 
then from Lemma [4] we get 

(11) P U n (r) < J x r dfi n (x) < b n (r)^j > 1 - o (n~ 5 / 4 ) 
In particular, by taking complements of the events above, we get 



1/4W2 



X) P ( 1 OnW ^ / ^M*) < Mr) \ ) < 5>(? 

71=1 \ < ' ' n=l 



-5/4 



< OO. 



Now consider a product probability space on which independent copies of our (countably many) random 
graphs are defined. Applying the Borel-Cantelli Lemma and (111 we get that 



(oo oo , „ 
n u \ a «w ^ / 



(x) < b n (r) M = 0. 



Taking the complements again, we get 



P U fl | a «( ? ') ^ / Xrd »n{x) < b n (r) 
\N=ln=N J 



1. 



This satisfies the condition (1) in Lemma [5] 

Once we show the validity of condition (2) for fj, equal to the semicircle law, we will be done by Lemma 
[5] Clearly, by our choice of d, n as in the statement, and the functions a n (r),b n (r) as in (10), this will be 
true once we establish 



lim (d n - l)- r/2 B r n = ( x r f sc (x)dx. 

n— >oo / 



We only need to verify above for even r, since for odd r, both sides are zero [B r n = since in a tree one cannot 
return to the root in an odd number of steps, and the moment is zero since f sc is a symmetric density) . 

Now, for an even r, the value of B n (r) has been computed by McKay in [55] (denoted by 6(r) in eq. (15) 
on (IFJ). It is given by 

B„(r) = I x r f n (x)dx, 

J-2y/d n -l 



where f n (x) is the Kesten-McKay density 

d n \/4(d n ~ 1 

Thus, changing variable to y — (d n — l)~ x l 2 x, we get 



Jn{x) = o_/^2 _ 2\ ' _2 V "n ~1<X< 2V d n - 1. 



lim (d n ly^Bl = lim - \^ r/? [ 2V ^ x r dn ^ { fz-\ -dx 



(d n - If' 2 J-2^d—i Md: 

,. I 2 r d n y/d n - ly/4-7/ 2 _ w/ 2 , 

~L 2 V 2^d n -m{d n /{d n -i)Y- y y(d n -i)) [dn iJ dy 

2 

y r fsc(y)dy. 

-2 



1 

2^ 



The last equality follows by the Dominated Convergence Theorem and the fact that lini„_ i . 00 d n — oo. This 
completes our proof. □ 

3. Estimating the rate of convergence of the ESD 

This is the longest section of the paper, and it is quite technical, so we provide an outline of the proof. The 
approach we will use is given by the Stieltjes transform of the adjacency matrix of the graph. To estimate 
how far the Stieltjes transform of the graph is from the Stieltjes transform of the semicircle, we will use as a 
stepping stone the resolvent of the adjacency matrix of a finite regular tree, which we will show to be very 
close to both. 

The estimation consists of the following steps: 
Step 0. Basic definitions and properties of the quantities involved (Section 3.1). 

Step 1. Compute the resolvent of the regular tree, and show that, in a certain growth regime for d n , its 

(root, root) elements is close to the Stieltjes transform of the semicircle (Section |3.2| . 
Step 2. Show that, in the same growth regime as before, the (root, root) element of the resolvent of the 

regular tree is very close to the Stieltjes transform of the regular graph (Section 3.3). 
Step 3. Use the estimations from the previous steps to conclude that the Stieltjes transform of the regular 

graph is close to that of the semicircle, and use the methods of [5D] to obtain bounds on the rate of 

convergence of the ESD (Section 3.4). 



9 



3.1. Basic Definitions. 



Definition 2. For a n x n Hermitian matrix A and a variable zgC for which Im(z) > (thus z is not an 
eigenvalue of A), define the Stieltjes transform to be the function 

s(A;z) := -tr(A-z/„) _1 
n 

:= -trM-zr 1 . 
n 

Here /„ is the n x n identity matrix; for convenience, we will drop the identity matrix and use the second 
notation. 

We will also require the notion of Chebyshev orthogonal polynomials of a complex variable. For more 
details, see the book by Mason and Handscomb [44j p. 14]. 
For a complex number z, define 

w = z + \J z 2 — 1, z — - (w + , 

where the square root of a complex number is taken such that the imaginary part is always positive. It can 
be verified easily that for any r > 1, the set 

(12) E r :={z: \w\ = r} 

is an ellipse whose foci are at {+1,-1}. 
Note that 

max = — — ^ , 

z£E r Z 

and that when r = 1, this ellipse degenerates to the interval [—1,1]. 

Definition 3. The nth Chebyshev polynomial of the second kind U n is defined as 



w n+1 — 

U n (z) = — , n = 1, 2, 



(13) 

w — w - 

with Uo(z) = 1. It is easy to check that, in addition, U n (z) satisfies the recursion 

(14) U n (z) = 2zU n - 1 (z) - U n - 2 (z), n=l,2,..., 
with the initial conditions Uo(z) = 1, U-i(z) = 0. 

Remark 5. When r — 1, the above gives us the traditional orthogonal polynomials for the semicircle law on 
the interval [—1,1]. 

We will need the following bound on U n which can be found in [HI eqns. (1.53), (1.55)]: 

(15) ■ r < |l7 n _i(2t)| < r , zeE r . 

Finally, we will need the standard formula for inverses of symmetric block matrices, given below. 

Proposition 6. Let A and D be complex symmetric matrices with sizes n x n, respectively, m x ra, and let 
B be an m x n real matrix. Define the (m + n) x (m + n) complex symmetric matrix 



where B' denotes the transpose o/B. Then 

i lA- 1 + A^BF^B'A" 1 -A^BF" 1 " 
(16) = [ -F-^B'A-i F _1 

Equivalently, by reversing the roles of the blocks A and D, 

im— lm'/n— l im-i i irn— lus'^n— Itoth— 1 



F = D - 



(17) 



-l 



These formulas are easy to verify and their proofs can be found in standard matrix algebra books. 
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3.2. Resolvents of regular and almost regular trees. Fix a positive integer d > 2. Let T be a finite 
ordered rooted tree of depth ( € N such that every vertex has exactly (d — 1) children. That is, the root has 
degree {d — 1), and every other vertex, except the leaves, has degree d. Such a tree is almost regular since 
all vertices, excluding the root and the leaves, have degree d. 

In order to define the adjacency matrix of this graph, we must fix a labeling; we will define this labeling 
recursively down to £ = 0, in which case all we have is a root vertex which we label 1. 

Imagine the tree embedded in the plane. If the depth is zero, the only element is the root, and the 
adjacency matrix is obvious. If the depth is one, the root has (d — 1) children. Consider each child vertex 
as a tree of depth zero, order their adjacency matrices Hi, Hi, . . . , H,i-i from left to right, and consider a 
block matrix with these as the diagonal blocks from upper left to bottom right. Finally add a bottom-most 
row and a rightmost column for the root vertex. 

By induction, suppose we have labeled the adjacency matrix for the tree of depth £ — 1. Consider now 
the tree of depth £. If we remove the root and the edges incident to it, we are left with (d— 1) trees of depth 
£ — 1 arranged from left to right. We consider their (d — 1) adjacency matrices and arrange them as diagonal 
blocks and add the root as the last element. 

Denote by H the adjacency matrix thus obtained. 

Lemma 7. For any complex number z such that > ; and recall the nth order Chebyshev polynomial, 
U n (z). Then the elements of the resolvent of the adjacency matrix H , (1/y/d— 1H — z)^ 1 , have the following 
properties: 

(i) 



yd — \ J roo t root Z + Z ~^~ 



-1 



where the previous refers to a continued fraction of depth £ (i.e., C + 1 recursions) ; 
(ii) the above can also be represented as 

-l 



(18) 



\Vd-l Jroot,root U C+1 (z/2) 



(iii) furthermore, 



(19) 



Proof. 



i 



.Vd-l 

where leaf represents any leaf ofT; 



H-z 



- 1 _ (ri-i)-c/ 2 

root.leaf U C+l( Z / 2 ) ' 



(i) Note that when ( = (i.e., the tree has only the root vertex), the equality is trivially true. 
We proceed by induction. Suppose the equality is true until depth £ — 1. Consider a tree of depth £ 
and label the adjacency H matrix as above. Thus 



(20) 



.H-z 



H 



fd-i 



Hi 



(21) 



Here u is the column vector representing the children of the root. Notice that u is (d— l) -1 / 2 exactly 
at the (d — 1) coordinates which are the last elements in each of the block matrices Hi, . . . , Hd-i 
and zero elsewhere. The vector u' is the transpose of u. 

We now use formula (16) treating the the final element [—z] as one block. Thus if <p(£) denote 
the element on the left side of (i) above, we get ip(C) = F^ 1 , where 

' ^ ip(C-l) = -z-<p((-l). 



F 



i~root 



Here tt i ~ root" refers to the children of the root which are, in their turn, the roots of trees of depth 
£ — 1. The formula now follows by induction. 

ii 



(ii) We will use the three term recurrence formula for continued fractions which we state below. More 
details can be found in the excellent book by Lorentzen and Waadeland O p. 5-6]. Given sequences 
of complex numbers {a n } and {b n } and a complex argument ui, one can define a continued fraction 
function with argument uj by defining 

S n (u) = b + j— j— . . . 7 — ■ — . 

61+62+ b n +uj 

By Lemma 1.1 in [32] we get the existence of complex sequences {A n } and {B n } such that 

A n _iw + A n 



B n -\u + B n 



for n 



1,2,. 



where 



(22) 



A n — b n A n —i + a n A n —2i 



B n — b n B n -i + a„B n -2 
1. 



with initial values A_i = 1, Aq = bo, B_i = and B 

In our case we will take each ai = — 1 and each bi = z, except 60 — 0. The recursions in (22) give 

us 

A n = zA n -i — A n ^2, B n = zB n -i — S„_2 

with the initial values A_i = 1, Aq = 0, £>_i = and Bq = 1. 

Comparing with the recursions of the Chebyshev polynomials U n given in ( 14 ) we get that 

B n (z) = U n (z/2) , A n (z) = -U n -!(z/2). 



Since clearly <p(£) — 5^ + i(0) we get formula (18). This proves part (ii). 
(iii) Since there is an obvious isomorphism of the tree that can exchange the labeling of leaves, it is 
enough to consider the leaf labeled 1 in the adjacency matrix. We will express 

-1 



1 



-H 



in terms of ip(£ — 1). 

Let N be the total number of vertices in the tree, and let A be the diagonal block matrix which is 
the upper left (N — 1) x (N — 1) block in (20). Then from the formula of inverses of block matrices 
we get 

m = -f-\a- i u) u 

where F is defined in (21 ). But F^ 1 is (p(Q an d simplifying the elements of w = A^ 1 u, we see that 
w(l) = (d- l)- 1 / 2 ^^- 1). In other words, V(C) = -( d - 1) _1/ V(C)V'(C - !)■ We get by induction 

M0 = 77E$Lll'P®> C=l,2 



Now we substitute formula (|18|) to obtain (|19|): 

(-1) C 



^(0 = 



C 

n 

i=0 



Ui(z/2) 



U (z/2) 



(d-l)-i/ 2 



U i+1 (z/2) (d-l)t/ 2 U (+1 {z/2) U (+1 (z/2) 



a 



Having now calculated the quantities ip and ip for this "slightly irregular" tree T, let us use them to find 
the corresponding quantities for the regular one, where the root is adjacent (just like all of the other non-leaf 
nodes) to precisely d edges (and thus has d children). We consider the same kind of labeling as before. 

Lemma |8] below is a variation of Lemma [7] above. 

Lemma 8. Let denote a d-regular tree of depth ( G N such that every vertex has degree d. Let Hd 
denote the adjacency matrix of the graph. The entries of its resolvent (1/y/d— lH^ — z) _1 have the following 
properties: 
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(i) 



(23) 



MO 



1 



(24) 



(ii) the above can also be represented as 

1 



= 1/2) 

root,root U C+1 (z/2) - (d - l)" 1 ^-! (z/2) 



1^(0 



where leaf represents any leaf ofT d 



(d-i)-c/ 2 



Proof. The proof is identical to that of the last lemma, except that we need to be careful in the first step of 
the recursion. Since the labeling of the vertices has the same principle as before, we have 



(25) 



1 



1 



where ip(-) has been defined in (21 ). This reflects the fact that the only change from before is in the number 
of children on the root (used to be d — 1, now is d). 



Substituting the value of tp from ( 18 1 we get 

-l 



1 



Vd^l 



H d -z 



= -z + 



d £/<-i(z/2) 



d-1 U c {z/2) 

U<(z/ 2 ) 

-zU c (z/2) + U c ^(z/2) + {d- l)-W C -!{z/2) 

Wg) 

U i+1 {z/2)-(d-l)-W^ 1 {z/2Y 



The final step above follows from the recursion of the Chebyshev polynomials given in (14 1. This proves (i). 
The proof of (ii) follows by a similar argument. □ 

Recall that we are ultimately interested in how close the Stieltjes transform of the d-regular graph is to 
the Stieltjes transform of the semicircle, s(z). Toward this goal, we will need some estimates for the functions 
defined in Lemma |S1 

Lemma 9. Consider the functions defined in Lemmas^ and [#| Then for all z such that z/2 G E r = 



{yeC: \y + ^y 2 ~ 



1 



(26) 



\, for some r > 1 such that r < 1/2, one has the following estimates. 
(i) Consider fd{C) an d s ( z ) = ~( z ~ V z 2 — 4)/2. We have the following estimate 

2r- 2 < 

I _ r -2C-2 



\M0 ~ s{z)\ < C 



1 

d-1 



where Co is a constant. 
(ii) The following bound on ijj(-) holds: 



(27) 



-C-i 



(d- l)</2 1 -r-2C-4- 



(iii) Similarly 



(28) 



\M0\ < Co 



-c-i 



(d- 1)C/ 2 1-r 



-2C-4 ' 



Remark 6. The condition r ^ < 1/2 is a priori more restrictive than necessary for the purposes of Lemma 
[j)} since we will in fact be interested in the case when r _l > = o(l), this does not matter. 
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Proof. (i) Let lu = (z + \/z 2 -4)/2 = -l/s(z). Note that |w| = r > 1. Recall from ((23]) and Q that 

1 l-a;-^- 2 



(29) 



^(0 



; C+2 _ uj-C-2 _ _L_ ( w c - W -C) w l - u;-2C-4 - gl- ( u -a _ w -2C-2) • 



Thus 



\M0\<r- 



1 + r-t- 2 

1 - 7-2C- 4 - (d - l)- 1 ^ 2 + r- 2 C- 2 ) 



as long as the right side above is positive. Since r > 1 we get <Pd(C) i s bounded by an absolute 
constant Co when < 1/2. 



We now estimate the quantity y(C)- By (18) and (13 1 we get 

,C+i 



<p(0 



-C-i 



^1-^-2 



C+2 



-C-2 



1 -CJ-2C-4- 



Thus 



kO + w _1 | = k(0-«(*) 



1-LJ-X- 2 
1 -W-2C-4 



.-2C-3 



1 

1 -W-2C-4 



< 



= r- x 

»-2C-3 
1 - r -2C-4' 



-2C-2 



-2C-4 



1 -W-2C-4 



To get to <y2(27 consider the formula (25 1. Note that 
1 



1 



Thus 



Hence 



z-s(z) z/2 + Vz 2 -4/2 
1 



= — Lo 1 = s(z). 



1^(0-^)1 = 



- ^-^C - 1) 



1 



z — s(z) 



u- l \\M0\ 



<c 



,-2£-l 



d- 1 7 1 -r-2C-2 d- 1 



Since r > 1 this completes the proof. 



(ii) We use the estimate on the Chebyshev polynomials given in (15) and our assumption on z to get 



i 



(d-l)C/ 2 

(iii) This part is similar, since 

\M0\ ' 



U c+1 (z/2) 



< 



-c- 1 1 



(d - l)^ 2 rC+2 - r-C- 2 (d - l)C/2 1 - r -2C-4 ' 



(d- l)C/2 



1 



U c+1 (z/2) 

-2 



1 



< 



l-(d-l)-i[/c_i(V2)/E/c+i(*/2) 
1 

(d - l)C/2 1 - r-2C-4 ^ 1 - (rf - |t7 c _ 1 (^/2)/^ c+1 (^/2)| 



Note that, by way of its definition, the constant Co in part (i) is an upper bound on the final 
term-hence the estimate. 

□ 



3.3. Prom trees to regular graphs. Consider now a (deterministic) (i-regular graph G with a distinguished 
vertex called the root such that, for some £ > 1, the (( + l)-ncighborhood of the root is a tree. That is to 
say, consider the subgraph consisting of all vertices in G whose distance from the root is at most £ + 1 and 
the edges between them; we assume that this subgraph has no cycles. 

This gives us a natural partition of the graph. We denote the tree subgraph induced by the root and all 
vertices of distance at most £ from the root by Td- We denote the "boundary" of this graph, that is, the set 
of vertices that are at distance exactly £ from the root, by dTd- The subgraph induced by the vertices in 
the complement of will be denoted by T^, and its own boundary, that is, the set of vertices at distance 
exactly ( + 1 from the root, will be denote by dT c d . For further clarification, please refer to Figure [I] 

Note that all edges betwen T d and are between dT d and dT d . 

Additionally we will denote the set of vertices of (respectively T^) by V(Td) (respectively V(T d )). 




Figure 1. Illustration of Td, T5, dTd, and dT d starting at a vertex root, with maximal 
tree depth (. Note that the neighbors in of the vertices from dT d may, but need not 
continue the tree-like structure. For this particular tree, d = 4. 



Let A denote the adjacency matrix of this graph and let Hd denote the adjacency matrix of the subgraph 
Td- Label the vertices of Hd as in Lemmas [7] and [8] and write A in the block matrix form 



A 



D B 
B' H d 



Here D is the adjacency matrix of T^, the matrix B records only and all the edges between dTd and dT d , 
and we again use the notation B' for the transpose of B. 

We will now proceed to estimate how close the Stieltjes transform of the regular graph is to that of the 
tree. 



Lemma 10. Fix a complex number z such that z G E r (see (12)) for some r > 1. Let e denote the quantity 

< = \^^=A-z) - { Ji — H d - z 



Vd^l' 



We have the following bound 



root. root 



2Cl 



root, root 



-2C-2 



Proof. Define the vector 



1 



V = B ^f3~i Hd ~ 7 eroot ' 



where e roo t is the vector that puts mass at the root vertex and zero elsewhere. 
Then by using the formula for the inverse of block matrices (17) we get 

(30) e = [ v' ( - 1 D -z- — *— B ( ,1 H d - z] B'\ 

1 ' d-l | \Vd^l d-l Wd-1 J J 

Our first job is to estimate the elements of the vector v. From the definition it is clear that the rows of 
B (and columns of B') are labeled by the vertices of T^. We write B iif to designate the ith row of B. 
We obtain 

Vi = B„ ( 1 H d - z] e root = V* B lk ( 1 H d - z) 

(31) _j 

= ^2 B ik I . H d - z J , by symmetry of H d . 

fc£V(T§) Wd-1 /root,*: 

Note that B^ is positive (i.e., 1) if and only if i g dT d , k £ dT d and i and k have an edge between them. 
Thus 

(i) Vi = unless i € dT d ; 

(ii) when i e 3T^, 

(32) «< = ^ {— ±=H d -z) =M0- 

kedY d ,k~i W« 1 / root.fc 



Here ipd(0 nas been defined in (24). The fact that there is exactly one k <E dT d such that i 
follows from our assumption that the (£ + 1) neighborhood of the root is a tree (see Figure [I]); 
(iii) by counting the number of elements in &T d , we get 



d r - 2 C-2 

(33) \\vf = d(d-l)<\M0\<C* 



I _ r -2C-4> 



where the final estimate is from ( 28 ) . 
Note that the matrix 



' D -z - — !— B ( , 1 H d - z] B' 



Wd-1 " d-l VVd - 1 

is precisely the matrix G appearing in ( 17 ). In particular, it is the top left block of the matrix Ud — 1)~ 1 / 2 A — z] 
Hence, by padding the vector v with extra zeros, we get a vector v such that 

1 -,( 1 , ' 

e = - — - tS . A - z 

d-l Wd-1 

Since the matrix A is real symmetric, it has only real eigenvalues. It follows from spectral decomposition 
of the real, symmetric matrix A, that for any real vector y 

i „ _V* IMI a 



(34) y' A -z) y < 

Observe from (32) that v = ipd(C) e i where e is a real vector of ones and zeroes which is one precisely for 
the labels corresponding to dT d . 

Combining our previous observations, we get 



Vd^i d-iy ^ a ^' KVd^i 
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Now using the bound in ( 33 1 to ( 30 ) we get 

1 



e < 



< 



d- 1 d- 1 $S(z) 



" °d-l \1 -r- 2 C-4 J 



2C 2 



1 _ r -2C-4 

This completes the proof of the lemma. 



r -2C-2 



□ 



We now arrive at our main result about deterministic regular graphs. Consider the set-up as in Lemma 
10 Now a consider a sequence of graphs G n such that G„ is d n -regular. Each G n has a marked vertex called 



the root such that for some sequence {Cn}, the Cn + 1 neighborhood of the root is acyclic in G n . 

Lemma 11. Assume that that sequences {d n } and {Cn} both tend to infinity with n with the following 
restriction. There exists a sequence r n such that 



(35) r n = e d " , for some < a < 1, and r n ^" = o(l/d n ). 

Then, if A n denotes the adjacency matrix of the graph G n , for all d n such that d n > (log2)~ 1 / c 

1 



have 



(36) 



A,, 



s(z) 



0(l/d n 



root, root 

where s(z) is the Stieltjes transform of the semicircle law s(z) = —z/2 + \f z 1 — 4/2. 

Proof. Under our assumptions certain simplifications are immediate. The constant Co appearing in Lemma 



[9] (and later) can be taken to be an absolute constant. Since 
in (l26l) such that 



"Cn 



o(l/ dn), we can choose Co large enough 



(37) 



|^c2„(Cn) - s i z n)\ < C /d n . 



Here z n is any sequence of complex numbers such that z n /2 belongs to the ellipse E Tn defined in (12 1. 
Note that 

-1 



Vd„ (Cn) 



1 



Vdn ~ 1 



A r . 



We now use Lemma [TOl 
Consider now 



S(z) 



e n 



One can easily verify the inequality 
(38) 



>x/2, for all x < log 2. 



Applying this inequality above, we get ?s(z) > d n a /2 for all d n as stated in the lemma. 
Combining with Lemma [lO] we get 



lei < 2Ci 



1 



1 - r, n 



^ i d%<C^dl = o{l/d n ). 



Combining with (37) this completes the proof of the theorem. 



□ 



We will need the following lemma to show that the previous "tree approximation" result about determin- 
istic regular graphs can be applied to the random graph by choosing almost any vertex as the "root" . 



Lemma 12. Let d n = (logn) 7 , for some positive 7. Let {r n } be as in Lemma 11 Let r\ n be defined as 
(r n — r n 1 )/^- F° r some (3 > 1, we define the sequence {Cn, n — 1, 2, . . .} satisfying 

1 logn 

4l0g(dn-l) 
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(39) 



Cn 



Let J(n) be the set of vertices in G n whose £ n -neighborhoods are acyclic and let \ J(n)\ denote its size. Let 
fl(n) be the event 

(40) \J(n)\/n>l- Vn /d n . 
One can choose (3 no larger than 4 such that 

1 



(41) P(n c (n))<o 

Proof. The proof of this result is very similar to the proof of Lemma [4] We again use the estimates of 
McKay, Wormald, and Wysocka [H] on the Poisson approximation to the number of short cycles. Consider 
a sequence {Cn} as in the statement. It is clear from the choice that 

(42) 4C„log(d„-l) = logn-/31og(d n -l), or 

(43) (d„-l) 4C " = {dn n _ lY =o{n). 

Thus, we can take g — 2Q n in ([6|. 

The argument is essentially the same as the one used in the proof of Lemma |4j rather than repeating it, 
we choose to only highlight the differences. 

The total number of vertices whose (^.-neighborhoods are not acyclic can be bounded above by 

Nl=J^s{d n -l)^-^M s , 

s=3 

where M s is the number of cycles of length s. 
Taking expectations and variances above we get 

EN* Qn = O ((d n - 1) 2C ") and 
Var(AT*J < £(2Cn + l)(d n - 1) 2C " + C*(2C« + d n )o{l) . 
We now want to use Chebyshev's inequality on the quantity 1 — |</(?i)| /n: 



p(l-^>^) < P(Nl>n Vn /d n ) 



< 3<%£(d n -l)^ | 3CX°(1) | d\ f {d n -l)^ 
n 2 r) 2 n2r ln nr ln V n 



Let us analyze the three terms that appear above. We use the inequality (38) to obtain rj n > d n a /2. 
Using ( 42 ) we get 

3<%Cn(<*n - 1) 2C " < o ,2X3 1 .,2c poly-log n 

nPrj* ~ " " (d n - l)P/» n 2 n n 3 / 2 ' 

3«o(l) 12<X^°o(l) poly-log n 

2^ - 2 = 2 



The leading term is the last term on the right which is of the order d\ ^ /(nr^), because by (42) we get 
(d n - l)^/n = (d n - 

We now choose f3 > 2a + 2 (and thus no larger than 4, since a < 1) such that 

^ < = o(i). 

Vn 

This completes the proof of the lemma. □ 
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3.4. The Stieltjes transform. We bring now all the results of the previous sections together. Below is the 
first theorem of this section. 

Theorem 13. Consider a sequence of d n -regular graphs on n vertices where d n — (logn) 7 for some 7 > 0. 
Let A n be the adjacency matrix of the graph and consider the Stieltjes transform 

-1 



s n (z) = tr 



1 



--At. 



> 0. 



. y/dn - 1 

Let rj n = (r n — r~ )/2 where r n — exp (d~ a ) for some < a < min(l, I/7). 

Let U n denote all complex numbers z such that $s(z) > r\ n . Then there is a large enough constant C > 
such that the Stieltjes transform of the empirical eigenvalue distribution of the n x n matrix A n satisfies 



P I sup \s n (z) - s(z)\ > C/d n < o(l/n) 



Here s(z) refers to the Stieltjes transform of the semicircle law. 



Proof of Theorem \13\ We condition on the event f2(n) which, from Lemma 12 happens with a probability 
of at least 1 — o(l /n). 



Consider the set J{n) from the Lemma 12 and write 



(44) 



s6J(n) 



1 



-A n z 



k.k 



- E 

k<£J(n) 



-A n z 



k.k 



ovc 



Now, the spectral norm of = A n — z^j is bounded above by l/3(z), which in turn is bounded ab 

by ry" 1 for all z £ U n . Thus, for all z £ U n and all k, we have the obvious bound ^ A n — z^j < i]^ 1 



Summing up over all k ^ J(n), we get 



1 E 



y/d n -l 



-At, 



k.k 



< n-\J{n)\ < J_ 
~ nrj n ~ dj 



where the final inequality holds on the event Q. Thus, we get 



s n{z) = ~ V 
11 — ' 



keJ(n) 



1 



y/d n - 1 



-At, 



O 



k.k 



Now, on the event every k £ J(n) can be considered as the root for a d„-regular tree of depth £ n . 

We can now apply Lemma [TO] and Lemma [TTj Note that, technically, the Cn in Lemma 12 and the £ used in 
Lemma 10 differ by at most one; however, this does not affect the following calculations. 

We verify the assumptions in Lemma |11| 



= exp 
= exp 
< Cexp 



1 .-a jog" 
4 " log(d„- 

1 (logn) 1 "" 7 

"41og(d„ - 1) ^ (logn)« 

(logn) 1 "" 7 



47 log log n 



< exp (-7 log log n) 



d n 



whenever a"f < 1. 

We now combine our error estimate (36 1 to see that for any k £ J(n), on the event f2(n), and for all 
z £ U n , we have 



(45) 



1 



y/d~^l 



A, 



z I (k, k) — s(z) 



O 



1 



where the constants in the O(-) above does not depend on k. 
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Combining this with the decomposition ( 44 ) we get 
(46) sup \a n (z) - s(z)\ = O 

z£U n 

on the event Q,(n), which holds with probability at least 1 — o(l/n). This completes the proof of the 
Lemma. □ 

Recall now the setup of Theorem [2j Fix 6 > 0. Let d n ,r] n be as in Theorem 13 Then we will show 
that there exists an N large enough such that for all n > N, for any interval I C R of length |/| > 
max{2r?„, rj n /(-6 log 6)}, 

■A/z — n f sc (x)dx 



< 5n \I\ 

J i 

with probability at least 1 — o(l/n). 

Here A/j is the number of eigenvalues of W n in the interval /, and f sc refers to the density of the semicircle 
law as in ([!]). 

Proof of Theorem^ Theorem 1 13| leads to Theorem [2] whose proof follows almost identically to Lemma 60 in 
the article by Tao and Vu |60| . The only major difference between our theorem and Lemma 60 of |60| is the 
fact that our interval I can lie anywhere on the real line and not restricted to a subset of (—2, 2) as Lemma 
60 requires. We provide an outline of the argument but skip the details. 

The idea lies in the observation that a good control over the Stieltjes transform near the real line allows one 
to invert the transform and have an estimate of the empirical spectral density. This is due to the following 
inversion formula: if G is a continuous distribution on the real line with Stieltjes transform sg, one gets 

1 [ b 

G[a, b] = lim — / Q (sg(x + ie)) dx. 

71" J a 

Fix an interval I C [—2, 2] such that |/| > 2?/. Define the function 



J I Vn 



Vn 



+ (y - x) 2 



-dx. 



Then it follows that if A; denotes the ith eigenvalues of the matrix 



A„, then 



n * — ' 7r j n * — ' ?7~ 

i=l J 1 i=l 



(Xi - xf 
3 (s n (x + irfn)) dx. 



dx 



_ 1 

7T Jl 

Also, if f sc denotes the density of the semicircle law, we get 

r-2 1 
-2 7T 

Using the approximation between s n and s obtained in Theorem [13] we get 

\s n (x + irin) - s(x + irj n )\dx 



3 (s(x + irj n )) dx. 



-X>(A,)- f F(y)f sc (y)dy < - f 

n ~{ J-2 K Jl 



< C 1 - 



with probability 1 — o 



Choose n large enough such that l/d n < 6 for all subsequent n. Thus 



- F(Xi) = f F(y)f sc (y)dy + 0(5 \I\ 



with probability 1 — o(l/n). 



2(1 



Now following the bounds in [BHl page 60, Proof of Lemma 64] we obtain the bounds 



F(y)fsc(y)dy = / f sc dy + O [r) n log — 



Putting all these together we obtain that with probability 1 — o(l/n) one has 

I/I 



Afi-n f sc dy = O (nS \I\) + 0[ nr) n log 

Ji V Vn 

Finally, as observed in Tao & Vu [BD], the latter term can be absorbed in the former since |/| > T) n /5log{l/S). 
This completes the proof of the Theorem. □ 

4. Delocalization of eigenvectors 

Closely related to approximation of the empirical spectral distribution is the fact that the L 2 -norm of a 
normalized eigenvector restricted to a large subset of the vertices cannot be small. We follow Definition [T] 
and prove Theorem [l] 

Recall the set-up of Theorem [5] Fix S > 0. 

Let T n C {1,2, ... ,n} be a sequence of sets of size L n = o{r)~). Let f2i(n) be the event that some 
unit-norm eigenvector of the matrix A n is (T ra ,5) localized, that is, 

Oi(n) = {3 i : ||wi|T,J| 2 > 1 — S, for some Vi such that A n V{ = XiVi} . 

Then, for all sufficiently large n, we will show that 



P«n l( n)n>e-W4. (l-o(i-)) 



= 1-0 



Proof of Theorem^ Consider the set J(n), defined in Lemma 12 of vertices whose Cn-neighborhoods are 
acyclic. Recall the event Cl(n), whose probability (as soon as n is large enough) is 1 — o(l/n), which is that 
\J{n)\/n > 1 - rj n /d„. 

We first prove part (i). The event fii(n) can be decomposed as two disjoint events depending on whether 
the set T n is a subset of J{n) or not. We first examine the event 

fl' 1 (n) = {T n C J(n)}nni(n) , 

for purposes of exclusion. 

Assume u) € J7(n) n Q[(n), and fix i, Uj depending on u and n. The matrix A n has n real eigenvalues and 
corresponding eigenspaces. The top eigenvalue is d n with corresponding eigenvector V\ = n~ x / 2 l, which is 
completely delocalized. Let us now choose V2, W3, ■ • ■ , v n to be a set of normalized eigenvectors, corresponding 
respectively to the eigenvalues A2, A3, . . . , A„ of ^ /rf 1 _ 1 A„. Fix a subset T n C {1, 2, . . . , n} of size L n as stated 
in the theorem. 

Consider again the Stieltjes transform of the matrix (d n — l) -1 ' 2 A n as in the proof of Theorem 13 That 

is, for z£C, with > 0, consider the matrix ( v / d 1 1 A n — zj . Then by the spectral representation it 
follows that for any 1 < k < n we get 

1 V 1 v 2 (k) 



. Vd n - 1 / fri (Aj - Z) 

Summing over the vertices k £ T n , we get 



-^75— T^- Z J (fc ' fc) = S (Aj - 2) 



(47) ^ -==A„-z 

fceT „ Wan 1 / j=1 
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Taking the imaginary part on both sides we get 

-l 



(48) 



\feeT„ 



j=l fceT„ 



> £ ^ 2 (fc)Q (A . _ z) -i > (1 _ jjcj (Ai _ 



Now we use the fact that for ui £ f2(n) and for all k £ J(n), we get from equation (45) that 



sup 

zG(7„ 



An-z) (k,k) — s(z) 



< 



C 



for some absolute constant C > 0. Recall that s(z), the Stieltjes transform of the semicircle density, is given 

by 

s(z) = - (-z+y/z 2 -£) . 
Thus, summing up over all k in T n we get 



sup 

zGS„ 



E 



1 



A n -z\ (k,k)-L n s(z) 



< 



Combining this estimate with (48 1 for z such that 5R(z) = and Q(z) = rj n we get 

L n ^{s{z)) + CL n /d n >Ti- x {l-8) 

for all cj £ Q(n) n ST^n). 

Since 3(s(z)) is bounded and L„ = o(t7^ 1 ), there is a large enough TV such that for all n > N, this 
inequality will not hold when lu £ Cl'^n). So we get that 

(49) P(fi(n)nni(n)) = 0, 

as soon as n is large enough. 
We can now write 

fti(n) = ((^(?i)) c nfii(n)) U (n(n)nOi(n)) ; 

the probability of the first of the two events above is bounded by the probability of (il(n)) c which is o(l/n). 
Further, by (49), we get that 

n(n)nfii(n) = fi(n)n(fii(n)\ni(n)) . 

Note that the last event is equivalent to saying that O(n) and fii(n) happen, and that in addition T„ 2 J(n). 
We now bound 

P[f2(n) n (O x (n) \Oi(n))] < P [Q(n) n {T„ £ ■%)}] 

= P[{T„£ J(n)} | Q(n)]P[Q(n)] 
1' 



1 - o 



P[{T n % J{n)} | fl(n) 



Note now that, given the size j = |J(n)|, any set of j labels chosen from {1, . . . ,n} is just as likely as any 
other, and independent of the set T n . So 

P [{T n % J(n)} | \J(n)\ =j] = l-P \{T n C J(n)} \ \J(n)\ = j] 

= , (T-t) 



(50) 
Since 



< 1 



P[{T n %J(n)} | n(n)]=P 



{ T B 2J( B )}|J«>l-£ 



it follows from (501 that 



P[{T n %J{n)} | n(n)] 



< 



1-1- 



and since L n — o(l/rj n ) and thus L n /n = o(rj n /d n ), it follows that 
(51) P[{T n £J(n)} | n(n)] < 1 - e^-W^ 

Putting all of these together, we conclude that 



n 



1 - o 



P[toi{n)] < 1 



-L„7)„/d„ 



which completes the proof. 



Note that part (ii) is proved in ( 49 ) 



□ 
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Appendix A. Eigenvalues and eigenvectors of regular trees 

Our main step in the above proofs was to understand the Stieltjes transforms or the resolvent matrix 
of a finite tree where every non-leaf vertex has (d — 1) children. It is quite straightforward to compute all 
the eigenvalues of such a tree. Explicit eigenvalues of the tree give us ideas about spacing distribution of 
eigenvalues of the random regular graph. Fix a positive integer d > 2. 

Lemma 14. Let T be a finite ordered rooted tree of depth ( e N such that every vertex has exactly (d — 1) 
children. That is, the root has degree (d — 1), and every other vertex, other than the leaves, has degree d. 
Let H denote the adjacency matrix of this graph. 

(i) Then, for any complex number z the characteristic polynomial of H is given by 
A(z;C) := det (zI--^=h\ = U c+1 (z/2) {[ U^-^" (*/2). 



(ii) The eigenvalues of the adjacency matrix are given by the following collection. Consider i = 1, 2 . . . , £; 
then twice the zeros of the Chebyshev polynomial U^+i—i appears with multiplicity [d— l) 1 — (d— 1) 4_1 . 
For i = 0, the multiplicity is one. 



Proof. Recall the recursive labeling of vertices as given in Section 3.2 



To prove conclusion (i), note that when £ = (i.e., the tree has only the root vertex), the equality is 
trivially true. We proceed by induction. Suppose the equality is true until depth £ — 1. Consider a tree of 
depth C and label the adjacency H matrix as above. Thus 

l 



(52) 



1 



H = 



fd-l 



Hi 



-H 2 



/d-l 



H 



d-l 



Here u is the column vector representing the children of the root. Notice that u is (d — l)" 1 / 2 exactly at 
the (d — 1) coordinates which are the last elements in each of the block matrices Hi, . . . and zero 

elsewhere. The vector u' is the transpose of u. 
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We now use the following well-known formula of determinant of block matrices (akin to (16l): 

\A B] 



det 



C D 



det (A) det (D - CA^B) 



We apply this to the matrix z — (d — 1)~ 1 I 2 H treating the the final element [z] as one block: Note that, 
by our labeling, in this case A is a diagonal block matrix, and hence its determinant is a product of the 
determinants of the individual blocks which are all the same and equal to A(z; £ — 1). Thus we get 



A(z;C) : 

As shown in Section |3.2[ the quantity 



(A(z;C 



(z-u'A^u) 



i'A~ 



= z + ip((-l) = z- 



E7c-i(«/2) _ zU c - U c -! _ U c+1 {z/2) 



U c (z/2) 



U c (z/2) 



Here U n is the Chebyshev polynomial of the second kind. Note the reversal of sign from Section 3.2 which 
is due to current reversal of sign from the resolvent matrix. 
Hence 



^•-=0 = %^#(A(*;C-l))*- 1 = n 



U 



t+l-i 



(z/2) 



Udz/2) v V ~" ~" i L l U C -i(z/2) 
The last term can be verified from the initial conditions of the Chebyshev polynomials. 
Simplifying a bit more, we get 

A(z; C) = U c+1 (z/2) H Ugg-W' 1 (z/2). 

i=l 

For part (ii), note from above that the i many zeroes of Ui appear with multiplicity (d — X) % — (d— 1)* 
Hence the total number of eigenvalues are 

C C 

c + 1 + £(c + 2 - 1) [(d - iy - (d i)*- 1 ] = ir 

i=l 



i=0 



which is the total number of vertices of the tree. 

The zeros of Chebyshev polynomials of order k can be easily shown to be given by 



□ 



COS 



fc + 1 



j = l,2,...,k. 



Thus an interesting phenomenon transpires in this analysis. If we drop the multiplicities and consider 
the empirical distribution of the distinct eigenvalues, they are the zeros of the Chebyshev polynomials of 
increasing order. These zeros are cosine transformations of equidistant points on the unit circle; and hence 
their empirical distribution converges to the arc-sine law. However, the entire empirical spectral distribution 
converges (see [13]) to the spectral distribution of the infinite tree which is the semicircle law. The effect 
of the multiplicities is strong enough to flip the "smile" of the arc-sine law to the "frown" of the semicircle! 
Also note that the gap of the spectrum from 2 is about twice of 7r 2 /(£ + l) 2 , and does not depend on d. 

Some facts about eigenvectors of this tree are also easy to derive and might be also worthwhile to look 
at. For example, by the spectral theorem, one can write 



(53) 



U c+1 (z/2) 



1 



H-z 



E 



| Pi ^root | 

A,- - z 



Here the sum on the right goes over distinct eigenvalues of the adjacency matrix and Pie roo t refers to the 
projection of the vector e roo t on the eigenspace corresponding to Aj. 

Notice that the above is a meromorphic function of z. From the leftmost expression in (53), it is obvious 
that the function has poles at (twice) the zeros of C^+i- It follows then that Pie roo t is zero for all eigenvalues 
except when Aj is twice of a root of f/f+i. However, these roots are simple, as we show in the previous 
lemma. Hence, ||-Pje roo t|| is precise the square of the 'root'-coordinate of the ith eigenvector. 
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Its value can be easily computed. For any root Aj of f/f+i, we get 

IIP II 2 v( on ^(^2) 2C/ c (A l ) 
H-nerootll = lim (z - 2A^ 



^2A 4 V v *7 c+1 (*/2) C^+i(Ai)" 
Here £/£ +1 refers to the derivative of the polynomial L^+i. Other coordinates can be similarly derived. 
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